TopFIND 2.0—linking protein termini with proteolytic processing and modifications altering protein function
نویسندگان
چکیده
Protein termini provide critical insights into the functional state of individual proteins. With recent advances in specific proteomics approaches to enrich for N- and C-terminomes, the global analysis of whole terminomes at a proteome-wide scale is now possible. Information on the actual N- and C-termini of proteins in vivo and any post-translational modifications, including their generation by proteolytic processing, is rapidly accumulating. To access this information we present version 2.0 of TopFIND (http://clipserve.clip.ubc.ca/topfind), a knowledgebase for protein termini, terminus modifications and underlying proteolytic processing. Built on a protein-centric framework TopFIND covers five species: Homo sapiens, Mus musculus, Arabidopsis thaliana, Saccharomyces cerevisiae and Escherichia coli and incorporates information from curated community submissions, publications, UniProtKB and MEROPS. Emphasis is placed on the detailed description and classification of evidence supporting the reported identification of each cleavage site, terminus and modification. A suite of filters can be applied to select supporting evidence. A dynamic network representation of the relationship between proteases, their substrates and inhibitors as well as visualization of protease cleavage site specificities complements the information displayed. Hence, TopFIND supports in depth investigation of protein termini information to spark new hypotheses on protein function by correlating cleavage events and termini with protein domains and mutations.
منابع مشابه
Proteome TopFIND 3.0 with TopFINDer and PathFINDer: database and analysis tools for the association of protein termini to pre- and post-translational events
The knowledgebase TopFIND is an analysis platform focussed on protein termini, their origin, modification and hence their role on protein structure and function. Here, we present a major update to TopFIND, version 3, which includes a 70% increase in the underlying data to now cover a 90,696 proteins, 165,044 N-termini, 130,182 C-termini, 14,382 cleavage sites and 33,209 substrate cleavages in H...
متن کاملMolecular detection of proteolytic activity of human parechovirus 2A protein by gene expression
Parechoviruses form one of the nine genera in the picornaviridae family, and include two human pathogens: Human parechovirus type1 and 2 (Hpev1 and Hpev2). The genome of picornaviruses encodes a single polyprotein, which undergoes a cleavage cascade performed by virus encoded proteases to give the final virus proteins. The primary cleavage occurs by 2A protein and this step is critical for vi...
متن کاملThe path of no return—Truncated protein N‐termini and current ignorance of their genesis
Almost all regulatory processes in biology ultimately lead to or originate from modifications of protein function. However, it is unclear to which extent each mechanism of regulation actually affects proteins and thus phenotypes. We assessed the extent of N-terminal protein truncation in a global analysis of N-terminomics data and find that most proteins have N-terminally truncated proteoforms....
متن کاملGlobal profiling of protease cleavage sites by chemoselective labeling of protein N-termini.
Proteolysis has major roles in diverse biologic processes and regulates the activity, localization, and intracellular levels of proteins. Linking signaling pathways and physiologic processes to specific proteolytic processing events is a major challenge in signal transduction research. Here, we describe N-CLAP (N-terminalomics by chemical labeling of the alpha-amine of proteins), a general appr...
متن کاملTAILS N-terminomics of human platelets reveals pervasive metalloproteinase-dependent proteolytic processing in storage.
Proteases, and specifically metalloproteinases, have been linked to the loss of platelet function during storage before transfusion, but the underlying mechanisms remain unknown. We used a dedicated N-terminomics technique, iTRAQ terminal amine isotopic labeling of substrates (TAILS), to characterize the human platelet N-terminome, proteome, and posttranslational modifications throughout platel...
متن کامل